New technology for raster document image compression
نویسندگان
چکیده
This paper describes in detail the LuraDocument technique, a recently developed, high performance technique for compressing and archiving scanned documents, particularly those containing text and image. LuraDocument offers higher compression rates and quality in comparison to traditional document compression methods, preserving text legibility even at extremely high compression rates. This various stages of LuraDocument compression are described in detail, including image quantization and text detection procedures.
منابع مشابه
JPEG2000: An Open Standard for Image Compression
LINK TO PAPER JPEG2000: An Open Standard for Image Compression Track: New Technology and Technology Integration Author(s): Walt Wiley The emerging JPEG2000 standard holds significant promise for the GIS community with regard to the compression and distribution of raster data. The standard describes features such as fully lossless compression, region of interest encoding and support for hyperspe...
متن کاملSegmentation and compression of documents with JPEG2000
We review the standard JPEG2000 for still image compression and mention some typical applications. Special weight is put onto the core coding system described in Part 1 and the compound image file format for document imaging described in Part 6 including a section on image segmentation. Index Terms — JPEG2000, still image compression, mixed raster graphics, segmentation
متن کاملMarkov Random Field Model Based Text Segmentation and Image Post Processing of Complex Scanned Documents
Haneda, Eri Ph.D., Purdue University, May 2011. Markov Random Field Model Based Text Segmentation and Image Post Processing of Complex Scanned Documents. Major Professor: Charles A. Bouman. In this dissertation, two image processing studies will be presented. The first study is segmentation for MRC document compression using an MRF model, and the second study is an automatic contrast enhancemen...
متن کاملCompression of Compound Documents
Compound (or mixed) document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites etc. Because of the very distinct nature of those two image classes (text/graphics vs. pictures), their compression invariably involves multiple compression systems and a region segmentation (classification) method. We rev...
متن کاملDocument Compression Using H.264/AVC
It has been verified that H.264/AVC, the newest video compression standard, can also be used to encode still images. In many cases, it outperforms state-of-art coders such as JPEG2000. For compound documents, the gains over JPEG2000 are even more expressive. In this scenario, the contributions of the present paper are distributed over four document encoding methods that use the H.264/AVC as a b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000